Modeling Noise Influence to Speech Intelligibility Non-Intrusively by Reduced Speech Dynamic Range
نویسنده
چکیده
The noise influence to speech signal waveform can be characterized by reduced speech dynamic range (rDR). This motivated the present work to propose an rDR-based intelligibility measure (denoted as rDRm) that could be used to non-intrusively (i.e., do not require clean reference speech signal) predict speech intelligibility in noise and is computed only using the dynamic range extracted from the noisecorrupted speech. The rDRm indices were evaluated with intelligibility scores obtained from normal-hearing listeners presented with sentences corrupted by four types of maskers in a total of 22 conditions. High correlation (r=0.93) was obtained between rDRm values and listeners’ sentence recognition scores, and this correlation was comparable to those computed with existing intrusive and non-intrusive intelligibility measures. This suggests that the dynamic range of speech signal may work as a simple but efficient predictor of speech intelligibility in noise, whose computation does not need access to the clean reference speech signal.
منابع مشابه
The effect of redesign workstation on Speech Interference Level (SIL) among bank tellers
Abstract Background: There is always an interaction between man and his environment that can be the cause of physical, physiological and psychological stress on people and also cause discomfort, annoyance, and have direct and indirect effects on their performance and productivity, health and safety. People in their workplace are exposed to many factors related to work activities and environmen...
متن کاملSpeech-in-noise enhancement using amplification and dynamic range compression controlled by the speech intelligibility index.
In many speech communication applications, such as public address systems, speech is degraded by additive noise, leading to reduced speech intelligibility. In this paper a pre-processing algorithm is proposed that is capable of increasing speech intelligibility under an equal-power constraint. The proposed AdaptDRC algorithm comprises two time- and frequency-dependent stages, i.e., an amplifica...
متن کاملImproving speech intelligibility in noise environments by spectral shaping and dynamic range compression
Speech produced under real conditions (not a recording studio, nor a quiet room) is not always equally intelligible due to the presence of background noise. This noise may mask part of the speech signal such that not all speech information is available to the listener. The ability to detect speech in noise plays a significant role in our communication with others. In this work we suggest the us...
متن کاملImproving speech intelligibility in background noise by SII-dependent amplification and compression
In many speech communication applications it is of great interest to achieve a high intelligibility to ensure good communication. However, in these applications speech is often disturbed by additive noise and/or reverberation. Therefore, it is desirable to develop algorithms that are able to maintain a high intelligibility in such disturbed scenarios. While amplifying the speech to achieve good...
متن کاملProceedings of Meetings on Acoustics
While broadband speech may remain perfectly intelligible at levels exceeding 90 dB, narrowband speech intelligibility (e.g., 2/3-octave passband centered at 1.5 kHz) may decline by 25% or more at moderate intensities (e.g., 75 dB). This "rollover" effect is substantially reduced, however, when a speech band is accompanied by flanking bands of white noise [J.A. Bashford, R.M. Warren, & P.W. Lenz...
متن کامل